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SEQUENCE LISTING 



10 



15 



(1) GENERAL INFORMATION: 

<i) APPLICANT: Hsei, Vanessa 

Koumenis , Iphigenia 
Leong, Steven R. 
Presta, Leonard G. 
Shahrokh, Zahra 
Zapata, Gerardo A. 



(ii) TITLE OF INVENTION: Antibody Fragment -Polymer Conjugates and 

Humanized Anti-IL-8 Monoclonal Antibodies 

(iii) NUMBER OF SEQUENCES: 76 



(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE : Genentech, Inc . 

20 (B) STREET: 1 DNA Way 

(C) CITY: South San Francisco 

(D) STATE : California 

(E) COUNTRY: USA 

(F) ZIP: 94080 



25 



30 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 3.5 inch, 1.44 Mb floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS -DOS 

(D) SOFTWARE : WinPatin (Genentech) 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 20-Feb-1998 
35 (C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Love, Richard B. 

(B) REGISTRATION NUMBER: 34,659 

40 ( C ) REFERENCE /DOCKET NUMBER: P1085R3PCT 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 650/225-5530 

(B) TELEFAX: 650/952-9881 
45 (2) INFORMATION FOR SEQ ID NO : 1 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: Nucleic Acid 
50 (C) STRANDEDNESS : Single 

(D) TOPOLOGY : Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



55 
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CAGTCCAACT GTTCAGGACG CC 22 
(2) INFORMATION FOR SEQ ID NO : 2 : 

5 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 22 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 



GTGCTGCTCA TGCTGTAGGT GC 22 

15 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 base pairs 
20 (B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 

25 

GAAGTTGATG TCTTGTGAGT GGC 23 
(2) INFORMATION FOR SEQ ID NO : 4 : 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 
35 (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 



40 GCATCCTAGA GTCACCGAGG AGCC 24 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 22 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

50 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 



CACTGGCTCA GGGAAATAAC CC 22 
55 (2) INFORMATION FOR SEQ ID NO:6: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



10 GGAGAGCTGG GAAGGTGTGC AC 22 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 3 5 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 



ACAAACGCGT ACGCTGACAT CGTCATGACC CAGTC 3 5 
25 (2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: Nucleic Acid 
30 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

35 

ACAAACGCGT ACGCTGATAT TGTCATGACT CAGTC 3 5 
(2) INFORMATION FOR SEQ ID NO: 9: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



ACAAACGCGT ACGCTGACAT CGTCATGACA CAGTC 35 

50 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 7 base pairs 
55 (B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 
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(D) TOPOLOGY: Linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

5 

GCTCTTCGAA TGGTGGGAAG ATGGATACAG TTGGTGC 37 
(2) INFORMATION FOR SEQ ID NO: 11: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



15 



20 



30 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 11 : 

CGATGGGCCC GGATAGACCG ATGGGGCTGT TGTTTTGGC 39 
(2) INFORMATION FOR SEQ ID NO: 12: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 9 base pairs 
25 (B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

CGATGGGCCC GGATAGACTG ATGGGGCTGT CGTTTTGGC 39 
(2) INFORMATION FOR SEQ ID NO : 13 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 
40 (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 



45 CGATGGGCCC GGATAGACGG ATGGGGCTGT TGTTTTGGC 3 9 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 3 9 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

55 (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 14: 
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CGATGGGCCC GGATAGACAG ATGGGGCTGT TGTTTTGGC 39 
(2) INFORMATION FOR SEQ ID NO: 15: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 
10 <D) TOPOLOGY: Linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



15 CGATGGGCCC GGATAGACCG ATGGGGCTGT TGTTTTGGC 3 9 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 3 9 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



CGATGGGCCC GGATAGACTG ATGGGGCTGT TGTTTTGGC 3 9 
30 (2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: Nucleic Acid 
35 (C) STRANDEDNESS : Single 

( D ) TOPOLOGY : Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 

40 

CGATGGGCCC GGATAGACAG ATGGGGCTGT TGTTTTGGC 3 9 
(2) INFORMATION FOR SEQ ID NO : 18 : 

45 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



50 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

CGATGGGCCC GGATAGACGG ATGGGGCTGT TGTTTTGGC 39 
(2) INFORMATION FOR SEQ ID NO: 19: 
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(i) SEQUKNCE CHARACTERISTICS: 

(A) LENGTH: 369 base pairs 

(B) TYPE: Nucleic Acid 

5 (C) STRANDEDNESS: Double 

( D ) TOPOLOGY : Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

10 

GACATTGTCA TGACACAGTC TCAAAAATTC ATGTCCACAT CAGTAGGAGA 50 
CAGGGTCAGC GTCACCTGCA AGGCCAGTCA GAATGTGGGT ACTAATGTAG 100 
15 CCTGGTATCA ACAGAAACCA GGGCAATCTC CTAAAGCACT GATTTACTCG 150 

TCATCCTACC GGTACAGTGG AGTCCCTGAT CGCTTCACAG GCAGTGGATC 200 
TGGGACAGAT TTCACTCTCA CCATCAGCCA TGTGCAGTCT GAAGACTTGG 250 

20 

CAGACTATTT CTGTCAGCAA TATAACATCT ATCCTCTCAC GTTCGGTCCT 300 

GGGACCAAGC TGGAGTTGAA ACGGGCTGAT GCTGCACCAC CAACTGTATC 350 

25 CATCTTCCCA CCATTCGAA 369 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS : 
30 (A) LENGTH: 123 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 



35 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Asp He Val Met Thr Gin Ser Gin Lys Phe Met Ser Thr Ser Val 
15 10 15 

Gly Asp Arg Val Ser Val Thr Cys Lys Ala Ser Gin Asn Val Gly 
20 25 30 

Thr Asn Val Ala Trp Tyr Gin Gin Lys Pro Gly Gin Ser Pro Lys 
35 40 45 

45 Ala Leu He Tyr Ser Ser Ser Tyr Arg Tyr Ser Gly Val Pro Asp 

50 55 60 

Arg Phe Thr Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr He 
65 70 75 



50 



55 



Ser His Val Gin Ser Glu Asp Leu Ala Asp Tyr Phe Cys Gin Gin 
80 85 90 

Tyr Asn He Tyr Pro Leu Thr Phe Gly Pro Gly Thr Lys Leu Glu 
95 100 105 
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Le u Lys Arg Ala Asp Ala Ala Pre Pro Thr Val Ser He Phe Pro 
110 115 

Pro Phe Glu 
5 123 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 
10 ( A ) LENGTH: 417 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

TTCTATTGCT ACAAACGCGT ACGCTGAGGT GCAGCTGGTG GAGTCTGGGG 50 
GAGGCTTAGT GCCGCCTGGA GGGTC CCTG A AACTCTCCTG TGCAGCCTCT 100 
GGATTCATAT TCAGTAGTTA TGGCATGTCT TGGGTTCGCC AGACTCCAGG 150 
CAAGAGCCTG GAGTTGGTCG CAACCATTAA TAATAATGGT GATAGCACCT 200 
ATTATCCAGA CAGTGTGAAG GGCCGATTCA CCATCTCCCG AGACAATGCC 250 
AAGAACACCC TGTACCTGCA AATGAGCAGT CTGAAGTCTG AGGACACAGC 300 
CATGTTTTAC TGTGCAAGAG CCCTCATTAG TTCGGCTACT TGGTTTGGTT 350 
ACTGGGGCCA AGGGACTCTG GTCACTGTCT CTGCAGCCAA AACAACAGCC 400 
CCATCTGTCT ATCCGGG 417 
(2) INFORMATION FOR SEQ ID NO -.22: 



20 



25 



30 



35 



40 



45 



50 



55 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 0 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 22: 

Glu val Gin Leu Val Glu Ser Gly Gly Oly Leu Val Pro Pro Gly 



1 



5 1° 



Gly Ser Leu Lys Leu Ser Cys Ala Ala Ser Gly Phe He Phe Ser 

20 25 

Ser Tyr Gly Met Ser Trp Val Arg Gin Thr Pro Gly Lys Ser Leu 
35 40 

Glu Leu Val Ala Thr He Asn Asn Asn Gly Asp Ser Thr Tyr Tyr 
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Pro Asp ser Val Lys Gly Arg Phe Thr He Ser Arg Asp Asn Ala 

65 70 
Ly S Asn Thr Leu Tyr Leu Gin Met Ser Ser Leu Lys Ser Glu Asp 
5 80 

Thr Ala Met Phe Tyr Cys Ala Axg Ala Leu He Ser Ser Ala Thr 
95 100 

10 Trp Phe Gly Tyr Trp Gly Gin Gly Thr Leu Val Thr Val Ser Ala 

110 115 

Ala Lys Thr Thr Ala Pro Ser Val Tyr Pro 
1 13 0 



15 



20 



25 



30 



40 



125 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

ACAAACGCGT ACGCTGATAT CGTCATGACA G 31 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 
35 (d) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 24: 



GCAGCATCAG CTCTTCGAAG CTCCAGCTTG G 31 
(2) INFORMATION FOR SEQ ID NO: 25: 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 21 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 



CCACTAGTAC GCAAGTTCAC G 21 
55 (2) INFORMATION FOR SEQ ID NO: 26: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 
5 (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

10 GATGGGCCCT TGGTGGAGGC TGCAGAGACA GTG 33 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 714 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

20 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

ATGAAGAAGA ATATCGCATT TCTTCTTGCA TCTATGTTCG TTTTTTCTAT 50 
25 TGCTACAAAC GCGTACGCTG ATATCGTCAT GACACAGTCT CAAAAATTCA 100 

TGTCCACATC AGTAGGAGAC AGGGTCAGCG TCACCTGCAA GGCCAGTCAG 150 
AATGTGGGTA CTAATGTAGC CTGGTATCAA C AG AAAC C AG GGCAATCTCC 200 
TAAAGCACTG ATTTACTCGT CATCCTACCG GTACAGTGGA GTCCCTGATC 250 
GCTTCACAGG CAGTGGATCT GGGACAGATT TCACTCTCAC CATCAGCCAT 300 
GTGCAGTCTG AAGACTTGGC AGACTATTTC TGTCAGCAAT ATAACATCTA 350 
TCCTCTCACG TTCGGTCCTG GGAC CAAGCT GGAGCTTCGA AGAGCTGTGG 400 
CTGCACCATC TGTCTTCATC TTCCCGCCAT CTGATGAGCA GTTGAAATCT 450 
GGAACTG CTT CTGTTGTGTG CCTGCTGAAT AACTTCTATC CCAGAGAGGC 500 
CAAAGTACAG TGGAAGGTGG ATAACGCCCT CCAATCGGGT AACTCCCAGG 550 
45 AGAGTGTCAC AGAGCAGGAC AGCAAGGACA GCACCTACAG CCTCAGCAGC 600 
ACCCTGACGC TGAGCAAAGC AGACTACGAG AAACACAAAG TCTACGCCTG 650 
CGAAGTCACC CATCAGGGCC TGAGCTCGCC CGTCACAAAG AGCTTCAACA 700 

50 

GGGGAGAGTG TTAA 714 

(2) INFORMATION FOR SEQ ID NO: 28: 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 237 amino acids 
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30 



35 



40 
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(B) TYPE: Amino Acid 
<D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

5 

Met Lys Lys Asn He Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
15 10 15 

Ser He Ala Thr Asn Ala Tyr Ala Asp He Val Met Thr Gin Ser 
10 20 25 30 

Gin Lys Phe Met Ser Thr Ser Val Gly Asp Arg Val Ser Val Thr 
35 40 45 

15 Cys Lys Ala Ser Gin Asn Val Gly Thr Asn Val Ala Trp Tyr Gin 

50 55 60 



20 



35 



50 



55 



Gin Lys Pro Gly Gin Ser Pro Lys Ala Leu He Tyr Ser Ser Ser 
65 70 75 

Tyr Arg Tyr Ser Gly Val Pro Asp Arg Phe Thr Gly Ser Gly Ser 
80 85 90 



Gly Thr Asp Phe Thr Leu Thr He Ser His Val Gin Ser Glu Asp 
25 95 100 105 

Leu Ala Asp Tyr Phe Cys Gin Gin Tyr Asn He Tyr Pro Leu Thr 
110 115 120 

30 Phe Gly Pro Gly Thr Lys Leu Glu Leu Arg Arg Ala Val Ala Ala 

125 130 135 



Pro Ser Val Phe He Phe Pro Pro Ser Asp Glu Gin Leu Lys Ser 

140 145 150 

Gly Thr Ala Ser Val Val Cys Leu Leu Asn Asn Phe Tyr Pro Arg 

155 160 165 



Glu Ala Lys Val Gin Trp Lys Val Asp Asn Ala Leu Gin Ser Gly 

40 170 175 180 

Asn Ser Gin Glu Ser Val Thr Glu Gin Asp Ser Lys Asp Ser Thr 

185 190 195 

45 Tyr Ser Leu Ser Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu 

200 205 210 

Lys His Lys Val Tyr Ala Cys Glu Val Thr His Gin Gly Leu Ser 

215 220 225 



Ser Pro Val Thr Lys Ser Phe Asn Arg Gly Glu Cys 
230 235 237 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 756 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
ATGAAAAAGA ATATCGCATT TCTTCTTGCA TCTATGTTCG TTTTTTCTAT 5 0 

10 

TGCTACAAAC GCGTACGCTG AGGTGCAGCT GGTGGAGTCT GGGGGAGGCT 100 
TAGTGCCGCC TGGAGGGTCC CTGAAACTCT CCTGTGCAGC CTCTGGATTC 150 
15 ATATTCAGTA GTTATGGCAT GTCTTGGGTT CGCCAGACTC CAGGCAAGAG 200 

CCTGGAGTTG GTCGCAACCA TTAATAATAA TGGTGATAGC ACCTATTATC 250 
CAGACAGTGT GAAGGGCCGA TTCACCATCT CCCGAGACAA TGCCAAGAAC 300 

20 

ACCCTGTACC TGCAAATGAG CAGTCTGAAG TCTGAGGACA CAGCCATGTT 350 
TTACTGTG C A AGAGCCCTCA TTAGTTCGGC TACTTGGTTT GGTTACTGGG 400 
25 GCCAAGGGAC TCTGGTCACT GTCTCTGCAG CCTCCACCAA GGGCCCATCG 450 

GTCTTCCCCC TGGCACCCTC CTCCAAGAGC ACCTCTGGGG GC AC AG CGGC 500 
CCTGGGCTGC CTGGTCAAGG ACTACTTCCC CGAACCGGTG ACGGTGTCGT 550 

30 

GGAACTCAGG CGCCCTGACC AGCGGCGTGC ACACCTTCCC GGCTGTCCTA 600 
CAGTCCTCAG GACTCTACTC CCTCAGCAGC GTGGTGACCG TGCCCTCCAG 650 
35 CAGCTTGGGC ACCCAGACCT ACATCTGCAA CGTGAATCAC AAGCCCAGCA 700 

ACAC CAAGGT GGACAAGAAA GTTGAGCCCA AATCTTGTGA CAAAACTCAC 750 
ACATGA 756 

40 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 51 amino acids 
45 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 

50 Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
15 10 15 

Ser lie Ala Thr Asn Ala Tyr Ala Glu Val Gin Leu Val Glu Ser 
20 25 30 

55 

Gly Gly Gly Leu Val Pro Pro Gly Gly Ser Leu Lys Leu Ser Cys 
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35 40 45 

Ala Ala Ser Gly Phe lie Phe Ser Ser Tyr Gly Met Ser Trp Val 
50 55 60 

5 

Arg Gin Thr Pro Gly Lys Ser Leu Glu Leu Val Ala Thr lie Asn 
65 70 75 

Asn Asn Gly Asp Ser Thr Tyr Tyr Pro Asp Ser Val Lys Gly Arg 
10 80 85 90 

Phe Thr lie Ser Arg Asp Asn Ala Lys Asn Thr Leu Tyr Leu Gin 
95 100 105 

15 Met Ser Ser Leu Lys Ser Glu Asp Thr Ala Met Phe Tyr Cys Ala 

110 115 120 

Arg Ala Leu He Ser Ser Ala Thr Trp Phe Gly Tyr Trp Gly Gin 
125 130 135 



20 



35 



Gly Thr Leu Val Thr Val Ser Ala Ala Ser Thr Lys Gly Pro Ser 
140 145 150 



Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr 
25 155 160 165 

Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val 
170 175 180 

30 Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr 

185 190 195 



Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser Leu Ser Ser 
200 205 210 

Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Gin Thr Tyr He 
215 220 225 



Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val Asp Lys Lys 
40 230 235 240 

Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr 
245 250 251 

45 (2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: Nucleic Acid 
50 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: 

55 

CAGTCCAACT GTTCAGGACG CC 22 
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(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 22 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 



GTGCTGCTCA TGCTGTAGGT GC 22 
15 (2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: Nucleic Acid 
20 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 

GAAGTTGATG TCTTGTGAGT GGC 23 
(2) INFORMATION FOR SEQ ID NO: 34: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



35 



40 



50 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 

GCATCCTAGA GTCACCGAGG AGCC 24 
(2) INFORMATION FOR SEQ ID NO: 35: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 base pairs 
45 (B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY : Linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

CACTGGCTCA GGGAAATAAC CC 22 
(2) INFORMATION FOR SEQ ID NO: 36: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 22 base pairs 

(B) TYPE: Nucleic Acid 
<C) STRANDEDNESS : Single 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 



GGAGAGCTGG GAAGGTGTGC AC 22 

10 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 37 base pairs 
15 (B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 



CCAATGCATA CGCTGACATC GTGATG AC CC AGACCCC 3 7 
(2) INFORMATION FOR SEQ ID NO: 38: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 
30 (D) TOPOLOGY: Linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 



35 CCAATGCATA CGCTGATATT GTGATGACTC AGACTCC 37 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 37 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 



CCAATGCATA CGCTGACATC GTGATGACAC AGACACC 37 
50 (2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: Nucleic Acid 
55 (C) STRANDEDNESS : Single 

(D) TOPOLOGY : Linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 



5 AGATGTCAAT TGCTCACTGG ATGGTGGGAA GATGG 35 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 3 2 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 



CAAACGCGTA CGCTGAGATC CAGCTGCAGC AG 32 
20 (2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: Nucleic Acid 
25 (C) STRANDEDNESS: Single 

( D ) TOPOLOGY : Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

30 

CAAACGCGTA CGCTGAGATT CAGCTCCAGC AG 32 
(2) INFORMATION FOR SEQ ID NO: 43: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 



CGATGGGCCC GGATAGACCG ATGGGGCTGT TGTTTTGGC 39 

45 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 9 base pairs 
50 (B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: 

55 
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CGATGGGCCC GGATAGACTG ATGGGGCTGT TGTTTTGGC 3 9 
(2) INFORMATION FOR SEQ ID NO: 45: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY : Linear 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:45: 

CGATGGGCCC GGATAGACAG ATGGGGCTGT TGTTTTGGC 39 
(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY : Linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

CGATGGGCCC GGATAGACGG ATGGGGCTGT TGTTTTGGC 39 
(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 91 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Double 

( D ) TOPOLOGY : Linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

GATATCGTGA TGACACAGAC ACCACTCTCC CTGCCTGTCA GTCTTGGAGA 50 
TCAGGCCTCC ATCTCTTGCA GATCTAGTCA GAGCCTTGTA CACGGTATTG 100 
GAAACACCTA TTTACATTGG TACCTGCAGA AGCCAGGCCA GTCTCCAAAG 150 
CTCCTGATCT ACAAAGTTTC CAACCGATTT TCTGGGGTCC CAGACAGGTT 200 
CAGTGGCAGT GGATCAGGGA CAGATTTCAC ACTCAGGATC AGCAGAGTGG 250 
AGGCTGAGGA TCTGGGACTT TATTTCTGCT CTCAAAGTAC AC ATGTTC CG 300 
CTCACGTTCG GTGCTGGGAC CAAGCTGGAG CTGAAACGGG CTGATGCTG C 350 
AC C AACTGT A TCCATCTTCC CACCATCCAG TGAGCAATTG A 3 91 
(2) INFORMATION FOR SEQ ID NO: 48: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 131 amino acids 

(B) TYPE: Amino Acid 
5 (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

Asp lie Val Met Thr Gin Thr Pro Leu Ser Leu Pro Val Ser Leu 

10 1 5 10 15 

Gly Asp Gin Ala Ser lie Ser Cys Arg Ser Ser Gin Ser Leu Val 

20 25 30 

15 His Gly lie Gly Asn Thr Tyr Leu His Trp Tyr Leu Gin Lys Pro 

35 40 45 

Gly Gin Ser Pro Lys Leu Leu lie Tyr Lys Val Ser Asn Arg Phe 

50 55 60 



20 



35 



45 



50 



Ser Gly Val Pro Asp Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp 
€5 70 75 



Phe Thr Leu Arg lie Ser Arg Val Glu Ala Glu Asp Leu Gly Leu 

25 80 85 90 

Tyr Phe Cys Ser Gin Ser Thr His Val Pro Leu Thr Phe Gly Ala 

95 100 105 

30 Gly Thr Lys Leu Glu Leu Lys Arg Ala Asp Ala Ala Pro Thr Val 

110 115 120 

Ser lie Phe Pro Pro Ser Ser Glu Gin Leu Lys 
125 130 131 



(2) INFORMATION FOR SEQ ID NO: 49: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 05 base pairs 
40 (B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 



GAGATTCAGC TGCAGCAGTC TGGACCTGAG CTGATGAAG C CTGGGGCTTC 50 
AGTGAAGATA TCCTGCAAGG CTTCTGGTTA TTCATT CAGT AGCCACTACA 100 
TGCACTGGGT GAAGCAGAGC CATGGAAAGA GCCTTGAGTG GATTGGCTAC 15 0 
ATTGATCCTT CCAATGGTGA AACTACTTAC AACCAGAAAT TCAAGGGCAA 200 
55 GGCCACATTG ACTGTAGACA CATCTTCCAG CACAGCCAAC GTGCATCTCA 250 
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10 



15 



20 



35 



GCAGCCTGAC ATCTGATGAC TCTGCAGTCT ATTTCTGTGC AAGAGGGGAC 300 
TATAGATACA ACGGCGACTG GTTTTTCGAT GTCTGGGGNG NAGGGAC C AC 350 
GGTCACCGTC TCCTCCGCCA AAACCGACAG CCCCATCGGT CTATCCGGGC 400 
CCATC 405 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 135 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Glu lie Gin Leu Gin Gin Ser Gly Pro Glu Leu Met Lys Pro Gly 
15 10 15 

Ala Ser Val Lys lie Ser Cys Lys Ala Ser Gly Tyr Ser Phe Ser 
20 25 30 



Ser His Tyr Met His Trp Val Lys Gin Ser His Gly Lys Ser Leu 

25 35 40 45 

Glu Trp lie Gly Tyr lie Asp Pro Ser Asn Gly Glu Thr Thr Tyr 

50 55 60 

30 Asn Gin Lys Phe Lys Gly Lys Ala Thr Leu Thr Val Asp Thr Ser 

65 70 75 

Ser Ser Thr Ala Asn Val His Leu Ser Ser Leu Thr Ser Asp Asp 

80 85 90 



Ser Ala Val Tyr Phe Cys Ala Arg Gly Asp Tyr Arg Tyr Asn Gly 
95 100 105 



Asp Trp Phe Phe Asp Val Trp Gly Xaa Gly Thr Thr Val Thr Val 
40 110 115 120 

Ser Ser Ala Lys Thr Asp Ser Pro lie Gly Leu Ser Gly Pro lie 
125 130 135 



45 (2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: Nucleic Acid 
50 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

55 

CTTGGTGGAG GCGGAGGAGA CG 22 
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(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 



15 



GAAACGGGCT GTTGCTGCAC CAACTGTATT CATCTTCC 38 
(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY : Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

GTCACCGTCT CCTCCGCCTC CAC CAAGGGC C 31 
(2) INFORMATION FOR SEQ ID NO: 54: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY : Linear 



20 



25 



35 



40 



45 



50 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

CTTGGTGGAG GCGGAGGAGA CG 22 
(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 9 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Double 

( D ) TOPOLOGY : Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:55: 

ATGAAGAAGA ATATCGCATT TCTTCTTGCA TCTATGTTCG TTTTTTCTAT 50 
TGCTACAAAT GCATACGCTG ATATCGTGAT GACACAGACA CCACTCTCCC 100 
TGC CTGTC AG TCTTGGAGAT CAGGCCTCCA TCTCTTGCAG ATCTAGTCAG 150 
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10 



15 



20 



25 



35 



40 



45 



50 



55 



AGCCTTGTAC ACGGTATTGG AAACACCTAT TTACATTGGT ACCTGCAGAA 200 
GCCAGGCCAG TCTCCAAAGC TCCTGATCTA CAAAGTTTCC AACCGATTTT 250 
CTGGGGTCCC AGACAGGTTC AGTGGCAGTG GATCAGGGAC AGATTTCACA 300 
CTCAGGATCA GCAGAGTGGA GGCTGAGGAT CTGGGACTTT ATTTCTGCTC 350 
TCAAAGTACA CATGTTCCGC TCACGTTCGG TGCTGGGACC AAGCTGGAGC 400 
TGAAACGGGC TGTTGCTGCA CCAACTGTAT TCATCTTCCC ACCATCCAGT 450 
GAGCAATTGA AATCTGGAAC TGCCTCTGTT GTGTGCCTGC TGAATAACTT 500 
CTATCCCAGA GAGGCCAAAG TACAGTGGAA GGTGGATAAC GCCCTCCAAT 550 
CGGGTAACTC CCAGGAGAGT GTCACAGAGC AGGACAGCAA GGACAGCACC 600 
TACAGCCTCA GCAGCACCCT GACGCTGAGC AAAGCAGACT ACGAGAAACA 650 
CAAAGTCTAC GCCTGCGAAG TCACCCATCA GGGCCTGAGC TCGCCCGTCA 700 
CAAAGAGCTT CAACAGGGGA GAGTGTTAA 729 
(2) INFORMATION FOR SEQ ID NO: 56: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 242 amino acids 
30 (B) TYPE: Amino Acid 

(D) TOPOLOGY : Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 



Met Lys Lys Asn He Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
1 5 1° 15 

Ser He Ala Thr Asn Ala Tyr Ala Asp He Val Met Thr Gin Thr 
20 25 30 

Pro Leu Ser Leu Pro Val Ser Leu Gly Asp Gin Ala Ser He Ser 
35 40 45 

Cvs Arg Ser Ser Gin Ser Leu Val His Gly He Gly Asn Thr Tyr 
50 55 60 

Leu His Trp Tyr Leu Gin Lys Pro Gly Gin Ser Pro Lys Leu Leu 
65 70 75 

He Tyr Lys Val Ser Asn Arg Phe Ser Gly Val Pro Asp Arg Phe 
80 85 90 

Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Arg He Ser Arg 
95 100 105 

Val Glu Ala Glu Asp Leu Gly Leu Tyr Phe Cys Ser Gin Ser Thr 
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110 115 120 

His Val Pro Leu Thr Phe Gly Ala Gly Thr Lys Leu Glu Leu Lys 

125 130 135 

5 

Arg Ala Val Ala Ala Pro Thr Val Phe lie Phe Pro Pro Ser Ser 

140 145 150 

Glu Gin Leu Lys Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn 
10 155 160 165 

Asn Phe Tyr Pro Arg Glu Ala Lys Val Gin Trp Lys Val Asp Asn 
170 175 180 

15 Ala Leu Gin Ser Gly Asn Ser Gin Glu Ser Val Thr Glu Gin Asp 

185 190 195 



20 



Ser Lys Asp Ser Thr Tyr Ser Leu Ser Ser Thr Leu Thr Leu Ser 

200 205 210 

Lys Ala Asp Tyr Glu Lys His Lys Val Tyr Ala Cys Glu Val Thr 

215 220 225 



His Gin Gly Leu Ser Ser Pro Val Thr Lys Ser Phe Asn Arg Gly 
25 230 235 240 

Glu Cys 
242 

30 (2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 762 base pairs 

(B) TYPE: Nucleic Acid 
35 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

40 

ATGAAAAAGA ATATCGCATT TCTTCTTGCA TCTATGTTCG TTTTTT CT AT 50 
TGCTACAAAC GCGTACGCTG AGATTCAGCT GCAGCAGTCT GGACCTGAGC 100 
45 TGATGAAGCC TGGGGCTTCA GTGAAGATAT CCTGCAAGGC TTCTGGTTAT 150 
- TCATTCAGTA GCCACTACAT GCACTGGGTG AAGCAGAGCC ATGGAAAGAG 200 
CCTTGAGTGG ATTGGCTACA TTGATCCTTC CAATGGTGAA ACTACTTACA 250 

50 

ACCAGAAATT CAAGGGCAAG GCCACATTGA CTGTAGACAC ATCTTCCAGC 300 
ACAGCCAACG TGCATCTCAG CAGCCTGACA TCTGATGACT CTGCAGTCTA 350 
55 TTTCTGTGCA AGAGGGGACT ATAGATACAA CGGCGACTGG TTTTTCGATG 400 
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TCTGGGGCGC AGGGACCACG GTCACCGTCT CCTCCGCCTC CACCAAGGGC 450 

CCATCGGTCT TCCCCCTGGC ACCCTCCTCC AAGAGCACCT CTGGGGGCAC 50 0 

AGCGGCCCTG GGCTGCCTGG TCAAGGACTA CTTCCCCGAA CCGGTGACGG 550 

TGTCGTGGAA CTCAGGCGCC CTGAC CAGCG GCGTGCACAC CTTCCCGGCT 600 

GTCCTACAGT CCTCAGGACT CTACTCCCTC AGCAGCGTGG TGACCGTGCC 650 

CTCCAGCAGC TTGGGCACCC AGACCTACAT CTGCAACGTG AATCACAAGC 700 

CCAGCAACAC CAAGGTGGAC AAGAAAGTTG AGCCCAAATC TTGTGACAAA 750 

15 ACTCACACAT GA 762 

(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 253 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY : Linear 



10 



25 



40 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

Met Lys Lys Asn He Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
15 10 15 



Ser He Ala Thr Asn Ala Tyr Ala Glu He Gin Leu Gin Gin Ser 
30 20 25 30 

Gly Pro Glu Leu Met Lys Pro Gly Ala Ser Val Lys He Ser Cys 
35 40 45 

35 Lys Ala Ser Gly Tyr Ser Phe Ser Ser His Tyr Met His Trp Val 

50 55 60 



Lys Gin Ser His Gly Lys Ser Leu Glu Trp He Gly Tyr He Asp 
65 70 75 

Pro Ser Asn Gly Glu Thr Thr Tyr Asn Gin Lys Phe Lys Gly Lys 
80 85 90 



Ala Thr Leu Thr Val Asp Thr Ser Ser Ser Thr Ala Asn Val His 
45 95 100 105 

Leu Ser Ser Leu Thr Ser Asp Asp Ser Ala Val Tyr Phe Cys Ala 
110 115 120 

50 Arg Gly Asp Tyr Arg Tyr Asn Gly Asp Trp Phe Phe Asp Val Trp 

125 130 135 

Gly Ala Gly Thr Thr Val Thr Val Ser Ser Ala Ser Thr Lys Gly 
140 145 150 



Pro Ser Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly 
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155 I" "5 

Gly Thr Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu 
Y 170 175 180 

Val Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val 



Pro 

185 



190 



His Thr Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser Leu 



10 



200 



205 21° 



15 



20 



Ser Ser Val Val Thr val Pro Ser Ser Ser Leu Gly Thr Gin Thr 
215 220 225 

Tyr lie Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val Asp 

1 235 240 



230 



Lys Lys Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr 
Y 245 250 253 

(2) INFORMATION FOR SEQ ID NO: 59: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 amino acids 
25 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

30 Asp lie Val Met Thr Gin Thr Pro Leu Ser Leu Pro Val Ser Leu 



10 



35 



40 



Gly Asp Gin Ala Ser He Ser Cys Arg Ser Ser Gin Ser Leu Val 
20 25 30 

His Gly He Gly Asn Thr Tyr Leu His Trp Tyr Leu Gin Lys Pro 



35 



Gly Gin Ser Pro Lys Leu Leu He Tyr Tyr Lys Val Ser Asn Arg 
50 55 60 



Phe Ser Gly Val Pro Asp Arg Phe Ser Asp Ser Gly Ser Gly Thr 
65 70 75 

45 Asp Phe Thr Leu Arg He Ser Arg Val Glu Ala Glu Asp Leu Gly 

80 85 

Leu Tyr Phe Cys Ser Gin Ser Thr His Val Pro Leu Thr Phe Gly 
95 100 105 

50 

Ala Gly Thr Lys Leu Glu Leu Lys Arg 
110 H4 

(2) INFORMATION FOR SEQ ID NO: 60: 

55 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 114 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY : Linear 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

Asp He Gin Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val 

15 10 15 

10 Gly Asp Arg Val Thr He Thr Cys Arg Ser Ser Gin Ser Leu Val 

20 25 30 



15 



20 



25 



30 



45 



His Gly He Gly Asn Thr Tyr Leu His Trp Tyr Gin Gin Lys Pro 

35 40 45 

Gly Lys Ala Pro Lys Leu Leu He Tyr Tyr Lys Val Ser Asn Arg 

50 55 60 

Phe Ser Gly Val Pro Ser Arg Phe Ser Gly Ser Gly Ser Gly Thr 

65 70 75 

Asp Phe Thr Leu Thr He Ser Ser Leu Gin Pro Glu Asp Phe Ala 

80 85 90 

Thr Tyr Tyr Cys Ser Gin Ser Thr His Val Pro Leu Thr Phe Gly 

95 100 105 

Gin Gly Thr Lys Val Glu He Lys Arg 
110 114 

(2) INFORMATION FOR SEQ ID NO: 61: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 109 amino acids 
35 (b) TYPE: Amino Acid 

( D ) TOPOLOGY : Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

40 Asp He Gin Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val 
x 5 10 15 

Gly Asp Arg Val Thr He Thr Cys Arg Ala Ser Lys Thr He Ser 
20 25 30 



Lys Tyr Leu Ala Trp Tyr Gin Gin Lys Pro Gly Lys Ala Pro Lys 
35 40 45 



Leu Leu He Tyr Tyr Ser Gly Ser Thr Leu Glu Ser Gly Val Pro 
50 50 55 60 

Ser Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr 
65 70 75 

55 He Ser Ser Leu Gin Pro Glu Asp Phe Ala Thr Tyr Tyr Cys Gin 

80 85 90 
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Gin His Asn Glu Tyr Pro Leu Thr Phe Gly Gin Gly Thr Lys Val 
95 100 105 

5 Glu lie Lys Arg 
109 

(2) INFORMATION FOR SEQ ID NO: 62: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:62: 

Glu lie Gin Leu Gin Gin Ser Gly Pro Glu Leu Met Lys Pro Gly 
15 10 15 

20 Ala Ser Val Lys lie Ser Cys Lys Ala Ser Gly Tyr Ser Phe Ser 

20 25 30 

Ser His Tyr Met His Trp Val Lys Gin Ser His Gly Lys Ser Leu 
35 40 45 

Glu Trp lie Gly Tyr lie Asp Pro Ser Asn Gly Glu Thr Thr Tyr 
50 55 60 



25 



Asn Gin Lys Phe Lys Gly Lys Ala Thr Leu Thr Val Asp Thr Ser 

30 65 70 75 

Ser Ser Thr Ala Asn Val His Leu Ser Ser Leu Thr Ser Asp Asp 

80 85 90 

35 Ser Ala Val Tyr Phe Cys Ala Ala Arg Gly Asp Tyr Arg Tyr Asn 

95 100 105 

Gly Asp Trp Phe Phe Asp Val Trp Gly Ala Gly Thr 

110 115 117 



40 



55 



(2) INFORMATION FOR SEQ ID NO: 63: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 
45 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

50 Glu Val Gin Leu Val Glu Ser Gly Gly Gly Leu Val Gin Pro Gly 
15 10 15 

Gly Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Tyr Ser Phe Ser 
20 25 30 



Ser His Tyr Met His Trp Val Arg Gin Ala Pro Gly Lys Gly Leu 
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35 40 45 

Glu Trp Val Gly Tyr lie Asp Pro Ser Asn Gly Glu Thr Thr Tyr 
50 55 60 

5 

Asn Gin Lys Phe Lys Gly Arg Phe Thr lie Ser Arg Asp Asn Ser 
65 70 75 

Lys Asn Thr Leu Tyr Leu Gin Met Asn Ser Leu Arg Ala Glu Asp 
10 80 85 90 

Thr Ala Val Tyr Tyr Cys Ala Ala Arg Gly Asp Tyr Arg Tyr Asn 
95 100 105 

15 Gly Asp Trp Phe Phe Asp Val Trp Gly Gin Gly Thr 

110 115 117 

(2) INFORMATION FOR SEQ ID NO:64: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 116 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

Glu Val Gin Leu Val Glu Ser Gly Gly Gly Leu Val Gin Pro Gly 
15 10 15 

30 Gly Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Ser Phe Thr 

20 25 30 

Gly His Trp Met Asn Trp Val Arg Gin Ala Pro Gly Lys Gly Leu 
35 40 45 



35 



50 



Glu Trp Val Gly Met lie His Pro Ser Asp Ser Glu Thr Arg Tyr 
50 55 60 



Ala Asp Ser Val Lys Gly Arg Phe Thr lie Ser Arg Asp Asn Ser 

40 65 70 75 

Lys Asn Thr Leu Tyr Leu Gin Met Asn Ser Leu Arg Ala Glu Asp 

80 85 90 

45 Thr Ala Val Tyr Tyr Cys Ala Ala Arg Gly lie Tyr Phe Tyr Gly 

95 100 105 

Thr Thr Tyr Phe Asp Tyr Trp Gly Gin Gly Thr 

110 115 116 



(2) INFORMATION FOR SEQ ID NO: 65: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 242 amino acids 
55 (B) TYPE: Amino Acid 

( D ) TOPOLOGY : Linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
5 1 5 10 15 

Ser lie Ala Thr Asn Ala Tyr Ala Asp lie Gin Met Thr Gin Ser 
20 25 30 

10 Pro Ser Ser Leu Ser Ala Ser Val Gly Asp Arg Val Thr lie Thr 

35 40 45 

Cys Arg Ser Ser Gin Ser Leu Val His Gly lie Gly Asn Thr Tyr 
50 55 60 



15 



30 



45 



Leu His Trp Tyr Gin Gin Lys Pro Gly Lys Ala Pro Lys Leu Leu 
65 70 75 



lie Tyr Lys Val Ser Asn Arg Phe Ser Gly Val Pro Ser Arg Phe 
20 80 85 90 

Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr lie Ser Ser 
95 100 105 

25 Leu Gin Pro Glu Asp Phe Ala Thr Tyr Tyr Cys Ser Gin Ser Thr 

110 115 120 

His Val Pro Leu Thr Phe Gly Gin Gly Thr Lys Val Glu lie Lys 
125 130 135 



Arg Thr Val Ala Ala Pro Ser Val Phe lie Phe Pro Pro Ser Asp 
140 145 150 



Glu Gin Leu Lys Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn 
35 155 160 165 

Asn Phe Tyr Pro Arg Glu Ala Lys Val Gin Trp Lys Val Asp Asn 
170 175 180 

40 Ala Leu Gin Ser Gly Asn Ser Gin Glu Ser Val Thr Glu Gin Asp 

185 190 195 

Ser Lys Asp Ser Thr Tyr Ser Leu Ser Ser Thr Leu Thr Leu Ser 
200 205 210 



Lys Ala Asp Tyr Glu Lys His Lys Val Tyr Ala Cys Glu Val Thr 
215 220 225 



His Gin Gly Leu Ser Ser Pro Val Thr Lys Ser Phe Asn Arg Gly 
50 230 235 240 

Glu Cys 
242 

55 (2) INFORMATION FOR SEQ ID NO: 66: 
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10 



25 



40 



55 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 253 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:66: 

Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
15 10 15 

Ser lie Ala Thr Asn Ala Tyr Ala Glu Val Gin Leu Val Gin Ser 
20 25 30 



Gly Gly Gly Leu Val Gin Pro Gly Gly Ser Leu Arg Leu Ser Cys 
15 35 40 45 

Ala Ala Ser Gly Tyr Ser Phe Ser Ser His Tyr Met His Trp Val 
50 55 60 

20 Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val Gly Tyr He Asp 

65 70 75 

Pro Ser Asn Gly Glu Thr Thr Tyr Asn Gin Lys Phe Lys Gly Arg 
80 85 90 



Phe Thr Leu Ser Arg Asp Asn Ser Lys Asn Thr Ala Tyr Leu Gin 
95 100 105 



Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys Ala 
30 110 115 120 

Arg Gly Asp Tyr Arg Tyr Asn Gly Asp Trp Phe Phe Asp Val Trp 
125 130 135 

35 Gly Gin Gly Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly 

140 145 150 



Pro Ser Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly 

155 160 165 

Gly Thr Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu 

170 175 180 



Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val 

45 185 190 195 

- His Thr Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser Leu 

200 205 210 

50 Ser Ser Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Gin Thr 

215 220 225 

Tyr lie Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val Asp 

230 235 240 



Lys Lys Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr 
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245 250 253 

(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 159 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

Ser Gly Gly Gly Ser Gly Ser Gly Asp Phe Asp Tyr Glu Lys Met 
15 10 15 

Ala Asn Ala Asn Lys Gly Ala Met Thr Glu Asn Ala Asp Glu Asn 
20 25 30 

Ala Leu Gin Ser Asp Ala Lys Gly Lys Leu Asp Ser Val Ala Thr 
35 40 45 

Asp Tyr Gly Ala Ala lie Asp Gly Phe lie Gly Asp Val Ser Gly 
50 55 60 

Leu Ala Asn Gly Asn Gly Ala Thr Gly Asp Phe Ala Gly Ser Ser 
65 70 75 

Asn Ser Gin Met Ala Gin Val Gly Asp Gly Asp Asn Ser Pro Leu 
80 85 90 

Met Asn Asn Phe Arg Gin Tyr Leu Pro Ser Leu Pro Gin Ser . Val 
95 100 105 

Glu Cys Arg Pro Phe Val Phe Ser Ala Gly Lys Pro Tyr Glu Phe 
110 us 120 

Ser lie Asp Cys Asp Lys lie Asn Leu Phe Arg Gly Val Phe Ala 
125 130 135 

Phe Leu Leu Tyr Val Ala Thr Phe Met Tyr Val Phe Ser Thr Phe 
140 145 150 

Ala Asn lie Leu Arg Asn Lys Glu Ser 
155 159 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 780 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:68: 



ATGAAAAAGA ATATCGCATT TCTTCTTGCA TCTATGTTCG TTTTTTCTAT 50 
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TGCTACAAAC GCATACGCTG ATATCCAGAT GACCCAGTCC CCGAGCTCCC 100 
TGTCCGCCTC TGTGGGCGAT AGGGTCACCA TCACCTGCAG GTCAAGTCAA 150 

5 

AGCTTAGTAC ATGGTATAGG TAACACGTAT TTACACTGGT ATCAACAGAA 2 00 
ACCAGGAAAA GCTCCGAAAC TACTGATTTA CAAAGTATCC AATCGATTCT 2 50 
10 CTGGAGTCCC TTCTCGCTTC TCTGGATCCG GTTCTGGGAC GGATTTCACT 3 00 
CTGACCATCA GCAGTCTGCA GCCAGAAGAC TTCGCAACTT ATTACTGTTC 350 
ACAGAGTACT CATGTCCCGC TCACGTTTGG ACAGGGTACC AAGGTGGAGA 4 00 

15 

TCAAACGAAC TGTGGCTGCA CCATCTGTCT TCATCTTCCC GCCATCTGAT 4 50 
GAGCAGTTGA AATCTGGAAC TGCTTCTGTT GTGTGCCTGC TGAATAACTT 500 
20 CTATC CCAGA GAGGCCAAAG TACAGTGGAA GGTGGATAAC GCCCTCCAAT 5 50 
CGGGTAACTC CCAGGAGAGT GTCACAGAGC AGGACAGCAA GGACAGCACC 600 
TACAGCCTCA GCAGCACCCT GACGCTGAGC AAAGCAGACT ACGAGAAACA 6 50 

25 

CAAAGTCTAC GCCTGCGAAG TCACCCATCA GGGCCTGAGC TCGCCCGTCA 7 00 

CAAAGAGCTT CAACAGGGGA GAGTGTTAAG CTGATCCTCT ACGCCGGACG 750 

30 CATCGTGGCC CTAGTACGCA ACTAGTCGTA 78 0 

(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 242 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 



40 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
15 10 15 



Ser lie Ala Thr Asn Ala Tyr Ala Asp lie Gin Met Thr Gin Ser 
45 20 25 30 

Pro Ser Ser Leu Ser Ala Ser Val Gly Asp Arg Val Thr lie Thr 
35 40 45 

50 Cys Arg Ser Ser Gin Ser Leu Val His Gly lie Gly Asn Thr Tyr 

50 55 60 

Leu His Trp Tyr Gin Gin Lys Pro Gly Lys Ala Pro Lys Leu Leu 
65 70 75 



lie Tyr Lys Val Ser Asn Arg Phe Ser Gly Val Pro Ser Arg Phe 
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80 85 90 

Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr He Ser Ser 
95 100 1° 5 

' Leu Gin Pro Glu Asp Phe Ala Thr Tyr Tyr Cys Ser Gin Ser Thr 

110 115 120 

His Val Pro Leu Thr Phe Gly Gin Gly Thr Lys Val Glu He Lys 
125 130 i35 

Arg Thr Val Ala Ala Pro Ser Val Phe He Phe Pro Pro Ser Asp 
140 145 150 



15 



Glu Gin Leu Lys Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn 



155 



160 165 



20 



Asn Phe Tyr Pro Arg Glu Ala Lys Val Gin Trp Lys Val Asp Asn 
170 175 180 

Gin Ser Gly Asn Ser Gin Glu Ser Val Thr Glu Gin Asp 



Ala Leu 

185 



190 195 



25 



Ser Lys Asp Ser Thr Tyr Ser Leu Ser Ser Thr Leu Thr Leu Ser 
200 205 210 

Lys Ala Asp Tyr Glu Lys His Lys Val Tyr Ala Cys Glu Val Thr 
215 220 225 

30 His Gin Gly Leu Ser Ser Pro Val Thr Lys Ser Phe Asn Arg Gly 

230 235 240 



Glu Cys 
242 



35 



(2) INFORMATION FOR SEQ ID NO: 70: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 253 amino acids 
40 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

45 Met Lys Lys Asn He Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
1 5 10 15 

Ser He Ala Thr Asn Ala Tyr Ala Glu Val Gin Leu Val Glu Ser 
20 25 30 



50 



55 



Gly Gly Gly Leu Val Gin Pro Gly Gly Ser Leu Arg Leu Ser Cys 
35 40 45 

Ala Ala Ser Gly Tyr Ser Phe Ser Ser His Tyr Met His Trp Val 
50 55 60 
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Lys Gin Ala Pro Gly Lys Gly Leu Glu Trp Val Gly Tyr He Asp 
65 70 75 

Pro Ser Asn Gly Glu Thr Thr Tyr Asn Gin Lys Phe Lys Gly Arg 
80 85 90 

Phe Thr Leu Ser Arg Asp Asn Ser Lys Asn Thr Ala Tyr Leu Gin 
95 100 105 

Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys Ala 
110 H5 120 

Arg Gly Asp Tyr Arg Tyr Asn Gly Asp Trp Phe Phe Asp Val Trp 
125 130 135 

Gly Gin Gly Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly 
140 145 150 

Pro Ser Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly 
20 155 1^0 165 

Gly Thr Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu 
170 175 180 



15 



25 



30 



Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val 
185 190 195 

His Thr Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser Leu 
200 205 210 

Ser Ser Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Gin Thr 
215 220 225 

Tvr He Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val Asp 
35 230 235 240 

Lys Lys Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr 
245 250 253 

40 (2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 242 amino acids 

(B) TYPE: Amino Acid 
45 (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

Met Lys Lys Asn He Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
50 1 5 10 15 

Ser He Ala Thr Asn Ala Tyr Ala Asp He Gin Met Thr Gin Ser 
20 25 30 

55 Pro Ser Ser Leu Ser Ala Ser Val Gly Asp Arg Val Thr He Thr 

35 40 45 
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10 



25 



40 



45 



50 



55 



Cys Arg Ser Ser Gin Ser Leu Val His Gly He Gly Ala Thr Tyr 

50 55 60 

Leu His Trp Tyr Gin Gin Lys Pro Gly Lys Ala Pro Lys Leu Leu 

65 70 75 

He Tyr Lys Val Ser Asn Arg Phe Ser Gly Val Pro Ser Arg Phe 

80 85 90 

Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr He Ser Ser 

95 100 105 



Leu Gin Pro Glu Asp Phe Ala Thr Tyr Tyr Cys Ser Gin Ser Thr 

15 110 115 120 

His Val Pro Leu Thr Phe Gly Gin Gly Thr Lys Val Glu He Lys 

125 130 135 

20 Arg Thr Val Ala Ala Pro Ser Val Phe He Phe Pro Pro Ser Asp 

140 145 150 

Glu Gin Leu Lys Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn 

155 160 165 



Asn Phe Tyr Pro Arg Glu Ala Lys Val Gin Trp Lys Val Asp Asn 
170 175 180 



Ala Leu Gin Ser Gly Asn Ser Gin Glu Ser Val Thr Glu Gin Asp 
30 185 190 195 

Ser Lys Asp Ser Thr Tyr Ser Leu Ser Ser Thr Leu Thr Leu Ser 
200 205 210 

35 Lys Ala Asp Tyr Glu Lys His Lys Val Tyr Ala Cys Glu Val Thr 

215 220 225 

His Gin Gly Leu Ser Ser Pro Val Thr Lys Ser Phe Asn Arg Gly 
230 235 240 



Glu Cys 
242 

(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:72: 

Cys Pro Pro Cys Pro Ala Pro Glu Leu Leu Gly Gly Arg Met Lys 
15 10 15 

Gin Leu Glu Asp Lys Val Glu Glu Leu Leu Ser Lys Asn Tyr His 
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20 



25 30 



Leu Glu Asn Glu Val Ala Arg Leu Lys Lys Leu Val Gly Glu Arg 
35 40 45 

5 

(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 780 base pairs 
10 (B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



15 



20 



30 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73 



ATGAAAAAGA ATATCGCATT TCTTCTTGCA TCTATGTTCG TTTTTTCTAT 50 
TGCTACAAAC GCATACGCTG ATATCCAGAT GACCCAGTCC CCGAGCTCCC 100 
TGTCCGCCTC TGTGGGCGAT AGGGTCACCA TCACCTGCAG GTCAAGTCAA 150 
AGCTTAGTAC ATGGTATAGG TGCTACGTAT TTACACTGGT ATCAACAGAA 200 
25 ACCAGGAAAA GCTCCGAAAC TACTGATTTA CAAAGTATCC AATCGATTCT 250 
CTGGAGTCCC TTCTCGCTTC TCTGGATCCG GTTCTGGGAC GGATTTCACT 300 
CTGACCATCA GCAGTCTGCA GCCAGAAGAC TTCGCAACTT ATTACTGTTC 3 50 
ACAGAGTACT CATGTCCCGC TCACGTTTGG ACAGGGTACC AAGGTGGAGA 400 
TCAAACGAAC TGTGG CTGCA CCATCTGTCT TCATCTTCCC GCCATCTGAT 450 
35 GAGCAGTTGA AATCTGGAAC TGCTTCTGTT GTGTGCCTGC TGAATAACTT 500 
CTATCCCAGA GAGGCCAAAG TACAGTGGAA GGTGGATAAC GCCCTCCAAT 550 
CGGGTAACTC CCAGGAGAGT GTCACAGAGC AGGACAGCAA GGACAGCACC 600 

40 

TACAGCCTCA GCAGCACCCT GACGCTGAGC AAAGCAGACT ACGAGAAACA 650 
CAAAGTCTAC GCCTGCGAAG TCACCCATCA GGGC CTGAGC TCGCCCGTCA 700 
45 CAAAGAGCTT CAACAGGGGA GAGTGTTAAG CTGATCCTCT ACGCCGGACG 750 
CATCGTGGCC CTAGTACGCA ACTAGTCGTA 780 
(2) INFORMATION FOR SEQ ID NO: 74: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 927 base pairs 

(B) TYPE : Nucleic Acid 

(C) STRANDEDNESS: Single 
55 (D) TOPOLOGY: Linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 
AAAAGGGTAT CTAGAGGTTG AGGTGATTTT ATGAAAAAGA ATATCGCATT 50 

5 

TCTTCTTGCA TCTATGTTCG TTTTTTCTAT TGCTACAAAC GCGTACGCTG 100 
AGGTTCAGCT AGTGCAGTCT GGCGGTGGCC TGGTGCAGCC AGGGGGCTCA 150 
10 CTCCGTTTGT CCTGTGCAGC TTCTGGCTAC TCCTTCTCGA GTCACTATAT 200 
GCACTGGGTC CGTCAGGCCC CGGGTAAGGG CCTGGAATGG GTTGGATATA 250 
TTGATCCTTC CAATGGTGAA ACTACGTATA ATCAAAAGTT CAAGGGCCGT 300 

15 

TTCACTTTAT CTCGCGACAA CTCCAAAAAC ACAGCATACC TGCAGATGAA 350 
CAGCCTGCGT GCTGAGGACA CTGCCGTCTA TTACTGTGCA AGAGGGGATT 4 00 
20 ATCGCTACAA TGGTGACTGG TTCTTCGACG TCTGGGGTCA AGGAACCCTG 4 50 
GTCACCGTCT CCTCGGCCTC CACCAAGGGC CCATCGGTCT TCCCCCTGGC 500 
ACCCTCCTCC AAGAGCACCT CTGGGGGCAC AGCGGCCCTG GGCTGCCTGG 550 

25 

TCAAGGACTA CTTCCCCGAA CCGGTGACGG TGTCGTGGAA CTCAGGCGCC 600 
CTGACCAGCG GCGTGCACAC CTTCCCGGCT GTCCTACAGT CCTCAGGACT 650 
30 CTACTCCCTC AGCAGCGTGG TGACCGTGCC CTCCAGCAGC TTGGGCACCC 700 
AGACCTACAT CTGCAACGTG AATCACAAGC CCAGCAACAC CAAGGTCGAC 750 
AAGAAAGTTG AGCCCAAATC TTGTGACAAA ACTCACACAT GCCCGCCGTG 8 00 

35 

CCCAGCACCA GAACTG CTGG GCGGCCGCAT GAAACAGCTA GAGGACAAGG 8 50 

TCGAAGAGCT ACTCTCCAAG AACTACCACC TAGAGAATGA AGTGGCAAGA 900 

40 CTCAAAAAGC TTGTCGGGGA GCGCTAA 927 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 298 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

50 

Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
15 10 15 

Ser He Ala Thr Asn Ala Tyr Ala Glu Val Gin Leu Val Gin Ser 
55 20 25 30 
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Gly 



Gly Gly Leu Val Gin Pro Gly Gly Ser Leu Arg Leu Ser Cys 



35 



40 



Ala Ala Ser Gly Tyr Ser Phe Ser Ser His Tyr Met His Trp Val 
50 55 60 

Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val Gly Tyr He Asp 
65 70 75 



15 



30 



Pro 



10 Pro Ser Asn Gly Glu Thr Thr Tyr Asn Gin Lys Phe Lys Gly Arg 

80 85 yo 

Phe Thr Leu Ser Arg Asp Asn Ser Lys Asn Thr Ala Tyr Leu Gin 
95 100 105 

Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys Ala 
110 I" 120 

Arg Gly Asp Tyr Arg Tyr Asn Gly Asp Trp Phe Phe Asp Val Trp 
20 125 I 30 

Gly Gin Gly Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly 
140 "5 I 50 

25 Pro Ser Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly 

155 160 i65 

Gly Thr Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu 
170 175 

val Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val 
185 190 195 

His Thr Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser Leu 
35 200 205 210 

Ser Ser Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Gin Thr 
215 220 225 

40 Tyr He Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val Asp 

230 235 

Lys Lys Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr Cys Pro 
245 250 255 

45 Pro Cys Pro Ala Pro Glu Leu Leu Gly Gly Arg Met Lys Gin Leu 

260 265 270 

Glu Asp Lys Val Glu Glu Leu Leu Ser Lys Asn Tyr His Leu Glu 
50 275 280 285 

Asn Glu Val Ala Arg Leu Lys Lys Leu Val Gly Glu Arg 
290 295 298 

55 (2) INFORMATION FOR SEQ ID NO: 76: 



-180- 



10 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6563 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY : Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

GAATTCAACT TCTCCATACT TTGGATAAGG AAATACAGAC ATGAAAAATC 50 
TCATTGCTGA GTTGTTATTT AAGCTTGCCC AAAAAGAAGA AGAGTCGAAT 100 
GAACTGTGTG CGCAGGTAGA AGCTTTGGAG ATTATCGTCA CTGCAATGCT 15 0 
TCGCAATATG GCGCAAAATG ACCAACAGCG GTTGATTGAT CAGGTAGAGG 200 
GGGCGCTGTA CGAGGTAAAG CCCGATGCCA GCATTCCTGA CGACGATACG 250 
GAGCTGCTGC GCGATTACGT AAAGAAGTTA TTGAAGCATC CTCGTCAGTA 300 
AAAAGTTAAT CTTTTCAACA GCTGTCATAA AGTTGTCACG GCCGAGACTT 350 
ATAGTCGCTT TGTTTTTATT TTTTAATGTA TTTGTAACTA GAATTCGAGC 4 00 
TCGGTACCCG GGGATCCTCT CGAGGTTGAG GTGATTTTAT GAAAAAGAAT 450 
ATCGCATTTC TTCTTGCATC TATGTTCGTT TTTTCTATTG CTACAAACGC 500 
ATACGCTGAT ATCCAGATGA CCCAGTCCCC GAGCTCCCTG TCCGCCTCTG 550 
TGGGCGATAG GGTCACCATC ACCTGCAGGT CAAGTCAAAG CTTAGTACAT 600 
GGTATAGGTG CTACGTATTT ACACTGGTAT CAACAGAAAC CAGGAAAAGC 650 
TCCGAAACTA CTGATTTACA AAGTATCCAA TCGATTCTCT GGAGTCCCTT 700 
CTCGCTTCTC TGGATCCGGT TCTGGGACGG ATTTCACTCT GACCATCAGC 750 
AGTCTGCAGC CAGAAGACTT CGCAACTTAT TACTGTTCAC AGAGTACTCA 800 
TGTCCCGCTC ACGTTTGGAC AGGGTACCAA GGTGGAGATC AAACGAACTG 850 
TGGCTGCACC ATCTGTCTTC ATCTTCCCGC CATCTGATGA GCAGTTGAAA 900 
TCTGGAACTG CTTCTGTTGT GTGCCTGCTG AATAACTTCT ATCCCAGAGA 950 
GGCCAAAGTA CAGTGGAAGG TGGATAACGC CCTCCAATCG GGTAACTCCC 1000 
AGGAGAGTGT CACAGAGCAG GACAGCAAGG ACAGCACCTA CAGCCTCAGC 1050 
AGCACCCTGA CGCTGAGCAA AGCAGACTAC GAGAAACACA AAGTCTACGC 1100 
CTGCGAAGTC ACCCATCAGG GCCTGAGCTC GCCCGTCACA AAGAGCTTCA 1150 
ACAGGGGAGA GTGTTAAGCT GATCCTCTAC GCCGGACGCA TCGTGGCCCT 1200 
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45 



50 
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AGTACGCAAC TAGTCGTAAA AAGGGTATCT AGAGGTTGAG GTGATTTTAT 1250 
GAAAAAGAAT ATCGCATTTC TTCTTGCATC TATGTTCGTT TTTTCTATTG 1300 

5 

CTACAAACGC GTACGCTGAG GTTCAGCTAG TGCAGTCTGG CGGTGGCCTG 1350 
GTGCAGC C AG GGGGCTCACT CCGTTTGTCC TGTGCAGCTT CTGGCTACTC 14 00 
10 CTTCTCGAGT CACTATATGC ACTGGGTCCG TCAGGCCCCG GGTAAGGGCC 14 50 

TGGAATGGGT TGGATATATT GATCCTTCCA ATGGTGAAAC TACGTATAAT 1500 
CAAAAGTTCA AGGGCCGTTT CACTTT AT CT CGCGACAACT C C AAAAAC AC 1550 

15 

AGCATACCTG CAGATGAACA GCCTGCGTGC TGAGGACACT GCCGTCTATT 1600 
ACTGTGCAAG AGGGGATTAT CGCTACAATG GTGACTGGTT CTTCGACGTC 1650 
20 TGGGGTCAAG GAACCCTGGT CACCGTCTCC TCGGCCTCCA CCAAGGGCCC 1700 

ATCGGTCTTC CCCCTGGCAC CCTCCTCCAA GAGCACCTCT GGGGGCACAG 1750 
CGGCCCTGGG CTGCCTGGTC AAGGACTACT TCCCCGAACC GGTGACGGTG 1800 

25 

TCGTGGAACT CAGGCGCCCT GACCAGCGGC GTGCACACCT TCCCGGCTGT 18 50 
CCTACAGTCC TCAGGACTCT ACTCCCTCAG CAGCGTGGTG ACCGTGCCCT 1900 
30 CCAGCAGCTT GGGCACCCAG ACCTACATCT G C AACGTGAA TCACAAGCCC 1950 

AGCAACACCA AGGTCGACAA GAAAGTTGAG CCCAAATCTT GTGACAAAAC 2 00 0 
TCACACATGC CCGCCGTGCC CAGCACCAGA ACTGCTGGGC GGCCGCATGA 20 5 0 

35 

AACAGCTAGA GGACAAGGTC GAAGAGCTAC TCTCCAAGAA CTACCACCTA 210 0 
GAGAATGAAG TGGCAAGACT CAAAAAGCTT GTCGGGGAGC GCTAAGCATG 215 0 
40 CGACGGCCCT AGAGTCCCTA ACGCTCGGTT GCCGCCGGGC GTTTTTTATT 2200 

GTTAACTCAT GTTTGACAGC TTATCATCGA TAAGCTTTAA TGCGGTAGTT 2250 
TATCACAGTT AAATTGCTAA CGCAGTCAGG C AC CGTGTAT GAAATCTAAC 2300 

45 

AATGCGCTCA TCGTCATCCT CGGCACCGTC ACCCTGGATG CTGTAGGCAT 2350 
AGGCTTGGTT ATGCCGGTAC TGCCGGGCCT CTTGCGGGAT ATCGTCCATT 2400 
50 CCGACAGCAT CGCCAGTCAC TATGGCGTGC TGCTAGCGCT ATATG CGTTG 2450 

ATGCAATTTC TATGCGC AC C CGTTCTCGGA GCACTGTCCG ACCGCTTTGG 2500 
CCGCCGCCCA GTCCTGCTCG CTTCGCTACT TGGAGCCACT ATCGACTACG 2550 

55 

CGATCATGGC GACCACACCC GTCCTGTGGA TCCTCTACGC CGGACGCATC 2600 
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GTGGCCGGCA TCACCGGCGC CACAGGTGCG GTTGCTGGCG CCTATATCGC 265 0 
CGACATCACC GATGGGGAAG ATCGGGCTCG CCACTTCGGG CTCATGAGCG 2700 
5 CTTGTTTCGG CGTGGGTATG GTGGCAGGCC CCGTGGCCGG GGGACTGTTG 275 0 

GGCGCCATCT CCTTGCACGC ACCATTCCTT GCGGCGGCGG TGCTCAACGG 2800 
10 CCTCAACCTA CTACTGGGCT GCTTCCTAAT GCAGGAGTCG CATAAGGGAG 2850 
AGCGTCGTCC GATGCCCTTG AGAGCCTTCA AC C CAGTCAG CTCCTTCCGG 2900 
TGGGCGCGGG GCATGACTAT CGTCGCCGCA CTTATGACTG TCTTCTTTAT 2950 
15 CATGCAACTC GTAGGACAGG TGCCGGCAGC GCTCTGGGTC ATTTTCGGCG 3000 

AGGAC CGCTT TCGCTGGAGC GCGACGATGA TCGGCCTGTC GCTTGCGGTA 3 050 
20 TTCGGAATCT TGCACGCCCT CGCTCAAGCC TTCGTCACTG GTCCCGCCAC 3100 
CAAACGTTTC GGCGAGAAGC AGGC CATTAT CGCCGGCATG GCGGCCGACG 3150 
CGCTGGGCTA CGTCTTGCTG GCGTTCGCGA CGCGAGGCTG GATGGCCTTC 3200 
CCCATTATGA TTCTTCTCGC TTCCGGCGGC AT CGGGATG C CCGCGTTGCA 32 50 
GGCCATGCTG TCCAGGCAGG TAGATGACGA CCATCAGGGA CAGCTTCAAG 3 3 00 
GATCGCTCGC GGCTCTTACC AGCCTAACTT CGATCACTGG ACCGCTGATC 3 3 50 
GTCACGGCGA TTTATGCCGC CTCGGCGAGC ACATGGAACG GGTTGGCATG 34 00 
GATTGTAGGC GCCGCCCTAT ACCTTGTCTG CCTCCCCGCG TTGCGTCGCG 3450 
GTGCATGGAG CCGGGCCACC TCGACCTGAA TGGAAGCCGG CGGCACCTCG 3500 
CTAACGGATT CACCACTCCA AGAATTGGAG CCAATCAATT CTTGCGGAGA 3550 
ACTGTGAATG CGCAAACCAA CCCTTGGCAG AACATATCCA TCGCGTCCGC 3600 
CATCTCCAGC AGCCGCACGC GGCGCATCTC GGGCAGCGTT GGGTCCTGGC 3650 
CACGGGTGCG CATGATCGTG CTCCTGTCGT TGAGGACCCG GCTAGGCTGG 3700 
CGGGGTTGCC TTACTGGTTA GCAGAATGAA TCACCGATAC GCGAG CGAAC 3 750 
GTGAAGCGAC TGCTGCTGCA AAACGTCTGC GACCTGAGCA ACAACATGAA 3800 
TGGTCTTCGG TTTCCGTGTT TCGTAAAGTC TGGAAACGCG GAAGTCAGCG 3850 
CCCTGCACCA TTATGTTCCG GATCTGCATC GCAGGATGCT GCTGGCTACC 3900 
CTGTGGAACA CCTACATCTG TATTAACGAA GCGCTGGCAT TGACCCTGAG 3950 
TGATTTTTCT CTGGTCCCGC CGCATCCATA CCGCCAGTTG TTTACCCTCA 4000 
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CAACGTTCCA GTAACCGGGC ATGTTCATCA TCAGTAACCC GTATCGTGAG 4050 
CATCCTCTCT CGTTT CATC G GTATCATTAC CCCCATGAAC AGAAATTCCC 410 0 

5 

CCTTACACGG AGGCATCAAG TGACCAAACA GGAAAAAACC GCCCTTAACA 4150 
TGGCCCGCTT TATCAGAAGC CAGACATTAA CGCTTCTGGA GAAACTCAAC 42 00 
10 GAGCTGGACG C GGATGAAC A GGCAGACATC TGTGAATCGC TTCACGACCA 4250 
CGCTGATGAG CTTTACCGCA GCTGCCTCGC GCGTTTCGGT GATGACGGTG 4300 
AAAACCTCTG ACACATGCAG CTC CCGGAGA CGGTCACAGC TTGTCTGTAA 4350 

15 

GCGGATGCCG GGAGCAGACA AGCCCGTCAG GGCGCGTCAG CGGGTGTTGG 4400 
CGGGTGTCGG GGCGCAGCCA TGACCCAGTC ACGTAGCGAT AGCGGAGTGT 4450 
20 ATACTGGCTT AACTATGCGG CATCAGAGCA GATTGTACTG AGAGTGCACC 4500 
ATATGCGGTG TGAAATACCG CACAGATGCG TAAGGAGAAA ATACCGCATC 4550 
AGGCGCTCTT CCGCTTCCTC GCTCACTGAC TCGCTGCGCT CGGTCGTTCG 4600 

25 

GCTGCGGCGA G CGGT ATCAG CTCACTCAAA GGCGGTAATA CGGTTATCCA 4650 
CAGAATCAGG GGATAACGCA GGAAAGAACA TGTGAGCAAA AGGCCAGCAA 4 700 
30 AAGGCCAGGA ACCGTAAAAA GGCCGCGTTG CTGGCGTTTT TCCATAGGCT 4750 
CCGCCCCCCT GACGAGCATC ACAAAAATCG ACGCTCAAGT CAGAGGTGGC 4800 
GAAACCCGAC AGGACTATAA AGATACCAGG CGTTTCCCCC TGGAAGCTCC 4850 

35 

CTCGTGCGCT CTCCTGTTCC GACCCTGCCG CTTAC CGG AT ACCTGTCCGC 4 900 
CTTTCTCCCT TCGGGAAGCG TGGCGCTTTC TCATAGCTCA CGCTGTAGGT 4 950 
40 ATCTCAGTTC GGTGTAGGTC GTTCGCTCCA AGCTGGGCTG TGTGCACGAA 5000 
CCCCCCGTTC AGCCCGACCG CTGCGCCTTA TCCGGTAACT ATCGTCTTGA 5050 
GTCCAACCCG GTAAGACACG ACTTATCGCC ACTGGCAGCA GCCACTGGTA 5100 

45 

ACAGGATTAG CAGAGCGAGG TATGTAGGCG GTGCTACAGA GTTCTTGAAG 5150 
TGGTGGCCTA ACTACGGCTA CACTAGAAGG ACAGTATTTG GTATCTGCGC 5200 
50 TCTG CTGAAG CCAGTTACCT TCGGAAAAAG AGTTGGTAGC TCTTGATCCG 5250 
GCAAACAAAC CACCGCTGGT AG CGGTGGTT TTTTTGTTTG CAAGCAGCAG 5300 
ATTACGCGCA GAAAAAAAGG ATCTCAAGAA GATCCTTTGA TCTTTTCTAC 5350 

55 

GGGGTCTGAC GCTCAGTGGA ACGAAAACTC ACGTTAAGGG ATTTTGGTCA 54 00 
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TGAGATTATC AAAAAGGATC TTCACCTAGA TCCTTTTAAA TTAAAAATGA 5450 
AGTTTTAAAT CAATCTAAAG TATATATGAG TAAACTTGGT CTGACAGTTA 5500 

5 

CCAATGCTTA ATCAGTGAGG CACCTATCTC AGCGATCTGT CTATTTCGTT 5550 
CATCCATAGT TGCCTGACTC CCCGTCGTGT AGATAACTAC GATACGGGAG 5600 
10 GGCTTACCAT CTGGCCCCAG TGCTGCAATG ATACCGCGAG ACCCACGCTC 5650 
ACCGGCTCCA GATTTATCAG CAATAAACCA GCCAGCCGGA AGGGCCGAGC 5700 
GCAGAAGTGG TCCTGCAACT TTATCCGCCT CCATCCAGTC TATTAATTGT 5750 

15 

TGCCGGGAAG CTAGAGTAAG TAGTTCGCCA GTTAATAGTT TGCGCAACGT 5800 
TGTTGCCATT GCTGCAGGCA TCGTGGTGTC ACGCTCGTCG TTTGGTATGG 5850 
20 CTTCATTCAG CTCCGGTTCC CAACGATCAA GGCGAGTTAC ATGATCCCCC 5900 

ATGTTGTGCA AAAAAGCGGT TAGCTCCTTC GGTCCTCCGA TCGTTGTCAG 5950 
AAGTAAGTTG GCCGCAGTGT TATCACTCAT GGTTATGGCA GCACTGCATA 6 000 

25 

ATTCTCTTAC TGTCATGCCA TCCGTAAGAT GCTTTTCTGT GACTGGTGAG 6050 
TACTCAACCA AGTCATTCTG AGAATAGTGT ATGCGGCGAC CGAGTTGCTC 6100 
30 TTGCCCGGCG TCAACACGGG ATAATACCG C GCCACATAGC AGAACTTTAA 6150 
AAGTGCTCAT CATTGGAAAA CGTTCTTCGG GGCGAAAACT CTCAAGGATC 6200 
TTACCGCTGT TGAGATCCAG TTCGATGTAA CCCACTCGTG CACCCAACTG 6250 

35 

ATCTTCAGCA TCTTTTACTT TCACCAGCGT TTCTGGGTGA GCAAAAACAG 6300 
GAAGGCAAAA TGCCGCAAAA AAGGGAATAA GGGCGACACG GAAATGTTGA 6350 
40 ATACTCATAC TCTTCCTTTT TCAATATTAT TGAAGCATTT ATCAGGGTTA 6400 
TTGTCTCATG AGCGGATACA TATTTGAATG TATTTAGAAA AATAAACAAA 6450 
TAGGGGTTCC GCGCACATTT CCCCGAAAAG TGCCACCTGA CGTCTAAGAA 6500 

45 

ACCATTATTA TCATGACATT AACCTATAAA AATAGGCGTA TCACGAGGCC 6550 
CTTTCGTCTT CAA 6563 
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